Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 10437 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 138.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 1 |
modular_ratio is highly correlated with ratio | High correlation |
weight is highly correlated with peak_number | High correlation |
peak_number is highly correlated with weight | High correlation |
ratio is highly correlated with modular_ratio | High correlation |
modular_ratio is highly correlated with ratio | High correlation |
weight is highly correlated with peak_number | High correlation |
peak_number is highly correlated with weight | High correlation |
ratio is highly correlated with modular_ratio | High correlation |
modular_ratio is highly correlated with ratio | High correlation |
ratio is highly correlated with modular_ratio | High correlation |
intercolumnar_distance is highly correlated with row_number | High correlation |
upper_margin is highly correlated with row_number and 1 other fields | High correlation |
lower_margin is highly correlated with row_number | High correlation |
row_number is highly correlated with intercolumnar_distance and 5 other fields | High correlation |
modular_ratio is highly correlated with ratio | High correlation |
interlinear_spacing is highly correlated with row_number and 1 other fields | High correlation |
weight is highly correlated with peak_number | High correlation |
peak_number is highly correlated with row_number and 2 other fields | High correlation |
ratio is highly correlated with modular_ratio and 1 other fields | High correlation |
class is highly correlated with upper_margin and 2 other fields | High correlation |
Reproduction
| Analysis started | 2022-09-03 01:21:35.013007 |
|---|---|
| Analysis finished | 2022-09-03 01:21:53.504450 |
| Duration | 18.49 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 143 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0008518787966 |
| Minimum | -3.498799 |
|---|---|
| Maximum | 11.819916 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4330 |
| Negative (%) | 41.5% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -3.498799 |
|---|---|
| 5-th percentile | -0.597995 |
| Q1 | -0.128929 |
| median | 0.056229 |
| Q3 | 0.204355 |
| 95-th percentile | 0.698109 |
| Maximum | 11.819916 |
| Range | 15.318715 |
| Interquartile range (IQR) | 0.333284 |
Descriptive statistics
| Standard deviation | 1.008551356 |
|---|---|
| Coefficient of variation (CV) | -1183.914143 |
| Kurtosis | 40.06528989 |
| Mean | -0.0008518787966 |
| Median Absolute Deviation (MAD) | 0.172814 |
| Skewness | 2.561404913 |
| Sum | -8.891059 |
| Variance | 1.017175837 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -3.498799 | 315 | 3.0% |
| 0.080916 | 311 | 3.0% |
| 0.15498 | 292 | 2.8% |
| 0.019197 | 277 | 2.7% |
| -0.042522 | 250 | 2.4% |
| 0.117948 | 247 | 2.4% |
| 0.130292 | 244 | 2.3% |
| -0.128929 | 235 | 2.3% |
| 0.068573 | 233 | 2.2% |
| 0.142636 | 228 | 2.2% |
| Other values (133) | 7805 |
| Value | Count | Frequency (%) |
| -3.498799 | 315 | |
| -3.486455 | 8 | 0.1% |
| -3.461768 | 4 | < 0.1% |
| -3.43708 | 4 | < 0.1% |
| -3.412392 | 11 | 0.1% |
| -3.054421 | 10 | 0.1% |
| -2.807544 | 9 | 0.1% |
| -2.573011 | 12 | 0.1% |
| -2.523635 | 12 | 0.1% |
| -2.47426 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 11.819916 | 8 | |
| 9.943651 | 13 | |
| 9.52396 | 5 | < 0.1% |
| 8.314263 | 12 | |
| 5.759087 | 10 | |
| 4.96908 | 9 | |
| 4.524702 | 14 | |
| 4.462983 | 5 | < 0.1% |
| 3.722352 | 9 | |
| 3.265629 | 12 |
| Distinct | 204 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.003395689758 |
| Minimum | -2.426761 |
|---|---|
| Maximum | 19.470188 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5940 |
| Negative (%) | 56.9% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -2.426761 |
|---|---|
| 5-th percentile | -0.613138 |
| Q1 | -0.259834 |
| median | -0.063555 |
| Q3 | 0.203385 |
| 95-th percentile | 0.572391 |
| Maximum | 19.470188 |
| Range | 21.896949 |
| Interquartile range (IQR) | 0.463219 |
Descriptive statistics
| Standard deviation | 0.9552571162 |
|---|---|
| Coefficient of variation (CV) | 281.314603 |
| Kurtosis | 170.3014356 |
| Mean | 0.003395689758 |
| Median Absolute Deviation (MAD) | 0.227684 |
| Skewness | 10.54813658 |
| Sum | 35.440814 |
| Variance | 0.9125161581 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.189174 | 201 | 1.9% |
| -0.291239 | 175 | 1.7% |
| -0.220579 | 159 | 1.5% |
| -0.338346 | 152 | 1.5% |
| 0.289748 | 147 | 1.4% |
| -0.087108 | 146 | 1.4% |
| -0.134216 | 137 | 1.3% |
| -0.063555 | 135 | 1.3% |
| 0.014957 | 134 | 1.3% |
| -0.09496 | 133 | 1.3% |
| Other values (194) | 8918 |
| Value | Count | Frequency (%) |
| -2.426761 | 106 | |
| -2.395356 | 13 | 0.1% |
| -2.08916 | 13 | 0.1% |
| -1.963541 | 17 | 0.2% |
| -1.947839 | 16 | 0.2% |
| -1.916434 | 12 | 0.1% |
| -1.680899 | 14 | 0.1% |
| -1.649494 | 6 | 0.1% |
| -1.304042 | 18 | 0.2% |
| -1.296191 | 14 | 0.1% |
| Value | Count | Frequency (%) |
| 19.470188 | 4 | < 0.1% |
| 17.570202 | 4 | < 0.1% |
| 16.965662 | 2 | < 0.1% |
| 12.655362 | 7 | 0.1% |
| 10.65331 | 4 | < 0.1% |
| 9.65621 | 10 | |
| 7.293004 | 18 | |
| 4.835583 | 14 | |
| 3.846334 | 5 | < 0.1% |
| 2.880639 | 6 | 0.1% |
| Distinct | 230 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00518116202 |
| Minimum | -3.210528 |
|---|---|
| Maximum | 7.458681 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1700 |
| Negative (%) | 16.3% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -3.210528 |
|---|---|
| 5-th percentile | -3.210528 |
| Q1 | 0.064919 |
| median | 0.217845 |
| Q3 | 0.356544 |
| 95-th percentile | 0.530808 |
| Maximum | 7.458681 |
| Range | 10.669209 |
| Interquartile range (IQR) | 0.291625 |
Descriptive statistics
| Standard deviation | 0.9924296863 |
|---|---|
| Coefficient of variation (CV) | 191.5457734 |
| Kurtosis | 10.59083647 |
| Mean | 0.00518116202 |
| Median Absolute Deviation (MAD) | 0.142256 |
| Skewness | -1.387287093 |
| Sum | 54.075788 |
| Variance | 0.9849166822 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -3.210528 | 610 | 5.8% |
| 0.239183 | 148 | 1.4% |
| 0.388552 | 148 | 1.4% |
| 0.235627 | 124 | 1.2% |
| 0.214288 | 124 | 1.2% |
| 0.171611 | 119 | 1.1% |
| 0.349432 | 118 | 1.1% |
| 0.299642 | 116 | 1.1% |
| 0.107596 | 115 | 1.1% |
| 0.324537 | 114 | 1.1% |
| Other values (220) | 8701 |
| Value | Count | Frequency (%) |
| -3.210528 | 610 | |
| -3.206971 | 19 | 0.2% |
| -3.203415 | 27 | 0.3% |
| -3.075385 | 16 | 0.2% |
| -2.975805 | 6 | 0.1% |
| -2.958023 | 14 | 0.1% |
| -2.908234 | 14 | 0.1% |
| -2.481465 | 7 | 0.1% |
| -2.349878 | 12 | 0.1% |
| -2.324983 | 11 | 0.1% |
| Value | Count | Frequency (%) |
| 7.458681 | 3 | < 0.1% |
| 7.419561 | 7 | |
| 6.260173 | 6 | |
| 5.49199 | 8 | |
| 5.196809 | 4 | < 0.1% |
| 5.083004 | 10 | |
| 4.329047 | 5 | |
| 3.941399 | 6 | |
| 3.28702 | 8 | |
| 2.48683 | 2 | < 0.1% |
exploitation
Real number (ℝ)
| Distinct | 748 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.002615837597 |
| Minimum | -5.440122 |
|---|---|
| Maximum | 3.987152 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4871 |
| Negative (%) | 46.7% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -5.440122 |
|---|---|
| 5-th percentile | -1.790689 |
| Q1 | -0.526838 |
| median | 0.087408 |
| Q3 | 0.627208 |
| 95-th percentile | 1.388496 |
| Maximum | 3.987152 |
| Range | 9.427274 |
| Interquartile range (IQR) | 1.154046 |
Descriptive statistics
| Standard deviation | 0.9914428365 |
|---|---|
| Coefficient of variation (CV) | 379.0154395 |
| Kurtosis | 3.220171595 |
| Mean | 0.002615837597 |
| Median Absolute Deviation (MAD) | 0.581006 |
| Skewness | -0.8334254482 |
| Sum | 27.301497 |
| Variance | 0.982958898 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -5.440122 | 37 | 0.4% |
| -0.184417 | 33 | 0.3% |
| -0.527256 | 30 | 0.3% |
| 0.557118 | 22 | 0.2% |
| 0.062642 | 22 | 0.2% |
| 1.199979 | 22 | 0.2% |
| 0.296423 | 22 | 0.2% |
| 0.14049 | 22 | 0.2% |
| 0.557894 | 22 | 0.2% |
| -0.755517 | 22 | 0.2% |
| Other values (738) | 10183 |
| Value | Count | Frequency (%) |
| -5.440122 | 37 | |
| -3.441837 | 4 | < 0.1% |
| -3.018853 | 12 | 0.1% |
| -2.9863 | 6 | 0.1% |
| -2.963951 | 5 | < 0.1% |
| -2.941364 | 8 | 0.1% |
| -2.832246 | 9 | 0.1% |
| -2.832037 | 2 | < 0.1% |
| -2.809808 | 5 | < 0.1% |
| -2.701227 | 9 | 0.1% |
| Value | Count | Frequency (%) |
| 3.987152 | 15 | |
| 2.791392 | 14 | |
| 2.258633 | 18 | |
| 2.211191 | 18 | |
| 2.123586 | 15 | |
| 2.046336 | 16 | |
| 2.04189 | 16 | |
| 2.021451 | 7 | 0.1% |
| 2.004055 | 15 | |
| 1.993015 | 7 | 0.1% |
| Distinct | 47 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.006365146306 |
| Minimum | -4.922215 |
|---|---|
| Maximum | 1.066121 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1585 |
| Negative (%) | 15.2% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -4.922215 |
|---|---|
| 5-th percentile | -1.436467 |
| Q1 | 0.17234 |
| median | 0.261718 |
| Q3 | 0.261718 |
| 95-th percentile | 0.976743 |
| Maximum | 1.066121 |
| Range | 5.988336 |
| Interquartile range (IQR) | 0.089378 |
Descriptive statistics
| Standard deviation | 1.007875799 |
|---|---|
| Coefficient of variation (CV) | -158.3429116 |
| Kurtosis | 13.99927347 |
| Mean | -0.006365146306 |
| Median Absolute Deviation (MAD) | 0.089378 |
| Skewness | -3.609422025 |
| Sum | -66.433032 |
| Variance | 1.015813626 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=47)
| Value | Count | Frequency (%) |
| 0.261718 | 5110 | |
| 0.17234 | 1966 | 18.8% |
| 0.976743 | 571 | 5.5% |
| 0.082961 | 515 | 4.9% |
| -4.922215 | 278 | 2.7% |
| 0.351096 | 179 | 1.7% |
| -0.006417 | 144 | 1.4% |
| -1.078955 | 123 | 1.2% |
| 0.887365 | 119 | 1.1% |
| -1.257711 | 106 | 1.0% |
| Other values (37) | 1326 | 12.7% |
| Value | Count | Frequency (%) |
| -4.922215 | 278 | |
| -4.832837 | 3 | < 0.1% |
| -4.743459 | 5 | < 0.1% |
| -4.654081 | 6 | 0.1% |
| -3.849677 | 9 | 0.1% |
| -3.313408 | 10 | 0.1% |
| -3.22403 | 18 | 0.2% |
| -3.134652 | 12 | 0.1% |
| -3.045274 | 25 | 0.2% |
| -2.777139 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 1.066121 | 27 | 0.3% |
| 0.976743 | 571 | 5.5% |
| 0.887365 | 119 | 1.1% |
| 0.797987 | 103 | 1.0% |
| 0.708609 | 95 | 0.9% |
| 0.61923 | 78 | 0.7% |
| 0.529852 | 27 | 0.3% |
| 0.440474 | 62 | 0.6% |
| 0.351096 | 179 | 1.7% |
| 0.261718 | 5110 |
| Distinct | 234 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.008885775031 |
| Minimum | -7.450257 |
|---|---|
| Maximum | 12.315569 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5493 |
| Negative (%) | 52.6% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -7.450257 |
|---|---|
| 5-th percentile | -1.429155 |
| Q1 | -0.598658 |
| median | -0.058835 |
| Q3 | 0.564038 |
| 95-th percentile | 1.643684 |
| Maximum | 12.315569 |
| Range | 19.765826 |
| Interquartile range (IQR) | 1.162696 |
Descriptive statistics
| Standard deviation | 1.000360272 |
|---|---|
| Coefficient of variation (CV) | -112.5799684 |
| Kurtosis | 4.823981764 |
| Mean | -0.008885775031 |
| Median Absolute Deviation (MAD) | 0.581347 |
| Skewness | 0.3201263528 |
| Sum | -92.740834 |
| Variance | 1.000720675 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.107265 | 217 | 2.1% |
| -0.307984 | 214 | 2.1% |
| -0.474083 | 211 | 2.0% |
| -0.058835 | 211 | 2.0% |
| 0.024215 | 208 | 2.0% |
| -0.349509 | 204 | 2.0% |
| -0.266459 | 203 | 1.9% |
| 0.190314 | 203 | 1.9% |
| -0.10036 | 203 | 1.9% |
| -0.01731 | 198 | 1.9% |
| Other values (224) | 8365 |
| Value | Count | Frequency (%) |
| -7.450257 | 3 | |
| -4.668092 | 1 | < 0.1% |
| -4.169795 | 1 | < 0.1% |
| -4.12827 | 2 | |
| -4.003695 | 1 | < 0.1% |
| -3.87912 | 1 | < 0.1% |
| -3.837595 | 1 | < 0.1% |
| -3.796071 | 2 | |
| -3.754546 | 3 | |
| -3.713021 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 12.315569 | 1 | |
| 7.996985 | 1 | |
| 5.048721 | 1 | |
| 4.965672 | 1 | |
| 4.924147 | 1 | |
| 4.633473 | 1 | |
| 4.550423 | 1 | |
| 4.508898 | 1 | |
| 4.384324 | 1 | |
| 4.342799 | 2 |
| Distinct | 229 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.002350016097 |
| Minimum | -11.935457 |
|---|---|
| Maximum | 4.901228 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3101 |
| Negative (%) | 29.7% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -11.935457 |
|---|---|
| 5-th percentile | -1.554093 |
| Q1 | -0.044076 |
| median | 0.220177 |
| Q3 | 0.446679 |
| 95-th percentile | 0.824183 |
| Maximum | 4.901228 |
| Range | 16.836685 |
| Interquartile range (IQR) | 0.490755 |
Descriptive statistics
| Standard deviation | 0.9668267617 |
|---|---|
| Coefficient of variation (CV) | 411.4128253 |
| Kurtosis | 34.62247159 |
| Mean | 0.002350016097 |
| Median Absolute Deviation (MAD) | 0.226503 |
| Skewness | -4.213841695 |
| Sum | 24.527118 |
| Variance | 0.9347539871 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.144676 | 483 | 4.6% |
| 0.295677 | 464 | 4.4% |
| 0.182426 | 463 | 4.4% |
| 0.220177 | 463 | 4.4% |
| 0.257927 | 450 | 4.3% |
| 0.333428 | 434 | 4.2% |
| 0.371178 | 425 | 4.1% |
| 0.446679 | 412 | 3.9% |
| 0.106925 | 403 | 3.9% |
| 0.069175 | 373 | 3.6% |
| Other values (219) | 6067 |
| Value | Count | Frequency (%) |
| -11.935457 | 10 | |
| -9.066425 | 1 | < 0.1% |
| -8.990925 | 3 | < 0.1% |
| -8.877673 | 1 | < 0.1% |
| -8.802173 | 1 | < 0.1% |
| -8.726672 | 2 | < 0.1% |
| -8.311418 | 1 | < 0.1% |
| -8.273667 | 1 | < 0.1% |
| -7.971663 | 1 | < 0.1% |
| -7.908746 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.901228 | 1 | < 0.1% |
| 4.18397 | 1 | < 0.1% |
| 3.995218 | 1 | < 0.1% |
| 3.466712 | 1 | < 0.1% |
| 3.24021 | 1 | < 0.1% |
| 3.164709 | 2 | |
| 2.938206 | 1 | < 0.1% |
| 2.900456 | 1 | < 0.1% |
| 2.862706 | 4 | |
| 2.787205 | 1 | < 0.1% |
| Distinct | 10103 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0102593157 |
| Minimum | -4.090167 |
|---|---|
| Maximum | 4.580832 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4709 |
| Negative (%) | 45.1% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -4.090167 |
|---|---|
| 5-th percentile | -1.8542382 |
| Q1 | -0.547709 |
| median | 0.103541 |
| Q3 | 0.639426 |
| 95-th percentile | 1.3946236 |
| Maximum | 4.580832 |
| Range | 8.670999 |
| Interquartile range (IQR) | 1.187135 |
Descriptive statistics
| Standard deviation | 0.9964310194 |
|---|---|
| Coefficient of variation (CV) | -97.12451085 |
| Kurtosis | 1.187849053 |
| Mean | -0.0102593157 |
| Median Absolute Deviation (MAD) | 0.584753 |
| Skewness | -0.6359868806 |
| Sum | -107.076478 |
| Variance | 0.9928747765 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.632941 | 3 | < 0.1% |
| 0.303132 | 3 | < 0.1% |
| 0.545553 | 3 | < 0.1% |
| -0.430731 | 3 | < 0.1% |
| -0.225492 | 3 | < 0.1% |
| -0.256808 | 3 | < 0.1% |
| 0.305831 | 2 | < 0.1% |
| -0.713478 | 2 | < 0.1% |
| 0.611157 | 2 | < 0.1% |
| -0.06147 | 2 | < 0.1% |
| Other values (10093) | 10411 |
| Value | Count | Frequency (%) |
| -4.090167 | 1 | |
| -3.97915 | 1 | |
| -3.929601 | 1 | |
| -3.915836 | 1 | |
| -3.903274 | 1 | |
| -3.882752 | 1 | |
| -3.861861 | 1 | |
| -3.818545 | 1 | |
| -3.814099 | 1 | |
| -3.775346 | 1 |
| Value | Count | Frequency (%) |
| 4.580832 | 1 | |
| 4.251197 | 1 | |
| 3.845958 | 1 | |
| 3.803399 | 1 | |
| 3.801536 | 1 | |
| 3.788158 | 1 | |
| 3.374999 | 1 | |
| 3.174224 | 1 | |
| 3.134442 | 1 | |
| 3.133587 | 1 |
| Distinct | 258 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.008690807608 |
| Minimum | -4.737863 |
|---|---|
| Maximum | 3.213413 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4687 |
| Negative (%) | 44.9% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -4.737863 |
|---|---|
| 5-th percentile | -1.993893 |
| Q1 | -0.372457 |
| median | 0.064084 |
| Q3 | 0.500624 |
| 95-th percentile | 1.560794 |
| Maximum | 3.213413 |
| Range | 7.951276 |
| Interquartile range (IQR) | 0.873081 |
Descriptive statistics
| Standard deviation | 1.001239548 |
|---|---|
| Coefficient of variation (CV) | -115.2067326 |
| Kurtosis | 2.371452235 |
| Mean | -0.008690807608 |
| Median Absolute Deviation (MAD) | 0.43654 |
| Skewness | -0.8400208036 |
| Sum | -90.705959 |
| Variance | 1.002480633 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.064084 | 245 | 2.3% |
| 0.001721 | 239 | 2.3% |
| -0.060642 | 238 | 2.3% |
| -0.123005 | 233 | 2.2% |
| 0.126447 | 228 | 2.2% |
| -0.029461 | 222 | 2.1% |
| 0.095265 | 214 | 2.1% |
| 0.18881 | 211 | 2.0% |
| 0.157628 | 205 | 2.0% |
| -0.154186 | 204 | 2.0% |
| Other values (248) | 8198 |
| Value | Count | Frequency (%) |
| -4.737863 | 3 | |
| -4.613137 | 1 | < 0.1% |
| -4.426048 | 1 | < 0.1% |
| -4.394866 | 1 | < 0.1% |
| -4.238959 | 2 | |
| -4.145415 | 1 | < 0.1% |
| -4.05187 | 2 | |
| -4.020689 | 1 | < 0.1% |
| -3.989507 | 1 | < 0.1% |
| -3.958326 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.213413 | 1 | < 0.1% |
| 3.119868 | 1 | < 0.1% |
| 2.932779 | 1 | < 0.1% |
| 2.901598 | 1 | < 0.1% |
| 2.839235 | 3 | |
| 2.808053 | 3 | |
| 2.776872 | 3 | |
| 2.74569 | 1 | < 0.1% |
| 2.714509 | 3 | |
| 2.683327 | 3 |
| Distinct | 9947 |
|---|---|
| Distinct (%) | 95.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0006784520456 |
| Minimum | -6.719324 |
|---|---|
| Maximum | 11.911338 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5324 |
| Negative (%) | 51.0% |
| Memory size | 81.7 KiB |
Quantile statistics
| Minimum | -6.719324 |
|---|---|
| 5-th percentile | -1.3770446 |
| Q1 | -0.514199 |
| median | -0.020397 |
| Q3 | 0.526304 |
| 95-th percentile | 1.5429124 |
| Maximum | 11.911338 |
| Range | 18.630662 |
| Interquartile range (IQR) | 1.040503 |
Descriptive statistics
| Standard deviation | 0.9929277614 |
|---|---|
| Coefficient of variation (CV) | -1463.519445 |
| Kurtosis | 7.526004847 |
| Mean | -0.0006784520456 |
| Median Absolute Deviation (MAD) | 0.518129 |
| Skewness | -0.3967824436 |
| Sum | -7.081004 |
| Variance | 0.9859055394 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.691759 | 12 | 0.1% |
| -6.719324 | 11 | 0.1% |
| -0.038166 | 7 | 0.1% |
| -0.554765 | 6 | 0.1% |
| -0.250722 | 6 | 0.1% |
| -0.189458 | 5 | < 0.1% |
| 1.317433 | 5 | < 0.1% |
| -0.397734 | 5 | < 0.1% |
| 0.688048 | 5 | < 0.1% |
| 0.179694 | 5 | < 0.1% |
| Other values (9937) | 10370 |
| Value | Count | Frequency (%) |
| -6.719324 | 11 | |
| -5.869281 | 1 | < 0.1% |
| -5.830644 | 1 | < 0.1% |
| -5.811561 | 1 | < 0.1% |
| -5.797456 | 1 | < 0.1% |
| -5.753371 | 1 | < 0.1% |
| -5.342652 | 1 | < 0.1% |
| -5.291744 | 1 | < 0.1% |
| -5.259038 | 1 | < 0.1% |
| -5.212433 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11.911338 | 1 | |
| 7.900729 | 1 | |
| 7.654104 | 1 | |
| 4.391396 | 1 | |
| 4.382016 | 1 | |
| 4.084038 | 1 | |
| 4.052297 | 1 | |
| 4.013347 | 1 | |
| 3.964041 | 1 | |
| 3.896388 | 1 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 591.3 KiB |
| A | |
|---|---|
| F | |
| E | |
| I | |
| X | |
| Other values (7) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10437 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | W |
|---|---|
| 2nd row | A |
| 3rd row | I |
| 4th row | E |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 4286 | |
| F | 1962 | |
| E | 1095 | 10.5% |
| I | 832 | 8.0% |
| X | 522 | 5.0% |
| H | 520 | 5.0% |
| G | 447 | 4.3% |
| D | 353 | 3.4% |
| Y | 267 | 2.6% |
| C | 103 | 1.0% |
| Other values (2) | 50 | 0.5% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| a | 4286 | |
| f | 1962 | |
| e | 1095 | 10.5% |
| i | 832 | 8.0% |
| x | 522 | 5.0% |
| h | 520 | 5.0% |
| g | 447 | 4.3% |
| d | 353 | 3.4% |
| y | 267 | 2.6% |
| c | 103 | 1.0% |
| Other values (2) | 50 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4286 | |
| F | 1962 | |
| E | 1095 | 10.5% |
| I | 832 | 8.0% |
| X | 522 | 5.0% |
| H | 520 | 5.0% |
| G | 447 | 4.3% |
| D | 353 | 3.4% |
| Y | 267 | 2.6% |
| C | 103 | 1.0% |
| Other values (2) | 50 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10437 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4286 | |
| F | 1962 | |
| E | 1095 | 10.5% |
| I | 832 | 8.0% |
| X | 522 | 5.0% |
| H | 520 | 5.0% |
| G | 447 | 4.3% |
| D | 353 | 3.4% |
| Y | 267 | 2.6% |
| C | 103 | 1.0% |
| Other values (2) | 50 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10437 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 4286 | |
| F | 1962 | |
| E | 1095 | 10.5% |
| I | 832 | 8.0% |
| X | 522 | 5.0% |
| H | 520 | 5.0% |
| G | 447 | 4.3% |
| D | 353 | 3.4% |
| Y | 267 | 2.6% |
| C | 103 | 1.0% |
| Other values (2) | 50 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10437 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 4286 | |
| F | 1962 | |
| E | 1095 | 10.5% |
| I | 832 | 8.0% |
| X | 522 | 5.0% |
| H | 520 | 5.0% |
| G | 447 | 4.3% |
| D | 353 | 3.4% |
| Y | 267 | 2.6% |
| C | 103 | 1.0% |
| Other values (2) | 50 | 0.5% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| intercolumnar_distance | upper_margin | lower_margin | exploitation | row_number | modular_ratio | interlinear_spacing | weight | peak_number | ratio | class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -3.498799 | 0.250492 | 0.232070 | 1.224178 | -4.922215 | 1.145386 | 0.182426 | -0.165983 | -0.123005 | 1.087144 | W |
| 1 | 0.204355 | -0.354049 | 0.320980 | 0.410166 | -0.989576 | -2.218127 | 0.220177 | 0.181844 | 2.090879 | -2.009758 | A |
| 2 | 0.759828 | -1.304042 | -0.023991 | -0.973663 | -0.006417 | -0.349509 | -0.421580 | -0.450127 | 0.469443 | 0.060952 | I |
| 3 | -0.005490 | 0.360409 | 0.281860 | -0.213479 | -1.168333 | -1.013906 | -0.346080 | 1.176165 | 0.968347 | -0.627999 | E |
| 4 | 0.080916 | 0.101320 | 0.104040 | 0.140490 | 0.261718 | 0.480988 | 0.710932 | -0.253430 | -0.497183 | 0.155681 | A |
| 5 | 0.068573 | -0.181323 | -3.210528 | -0.294311 | -1.168333 | 0.356414 | -0.006326 | -0.219550 | 0.126447 | 0.448186 | F |
| 6 | -0.301743 | -0.314793 | 0.399221 | 0.770520 | 0.708609 | 0.564038 | -1.403091 | -1.459107 | -0.091823 | 1.627420 | Y |
| 7 | 0.031541 | -0.118513 | 0.374326 | -0.066706 | 0.261718 | 0.605563 | 0.559930 | -0.258129 | 0.095265 | 0.344766 | A |
| 8 | -0.091897 | -0.118513 | 0.189393 | 1.280303 | 0.261718 | 0.314889 | 0.069175 | 1.277183 | 0.531806 | 0.359002 | A |
| 9 | 0.377169 | 0.014957 | 0.381439 | 0.292753 | 0.261718 | -0.307984 | 0.522180 | 0.370989 | 0.562987 | -0.440132 | H |
Last rows
| intercolumnar_distance | upper_margin | lower_margin | exploitation | row_number | modular_ratio | interlinear_spacing | weight | peak_number | ratio | class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 10427 | 0.105604 | -0.149918 | 0.438342 | 0.934659 | 0.261718 | 0.273364 | 0.182426 | 0.722718 | 0.126447 | 0.263718 | A |
| 10428 | -0.190648 | -1.092060 | 0.402778 | 0.749425 | 0.172340 | 0.771662 | 0.559930 | 0.181727 | 0.407080 | 0.492670 | D |
| 10429 | -0.017834 | 0.234790 | -0.180472 | 0.621032 | 0.261718 | 0.439463 | 0.673182 | -0.262982 | 0.064084 | 0.140600 | F |
| 10430 | 0.661077 | -0.094960 | 0.015130 | 0.751961 | 0.976743 | -0.100360 | -1.403091 | 1.037054 | 1.436069 | 0.926642 | I |
| 10431 | -0.437525 | 0.423218 | 0.388552 | 0.620852 | 0.172340 | -0.889332 | 0.144676 | 2.131034 | 1.373706 | -0.771516 | E |
| 10432 | -0.128929 | -0.040001 | 0.057807 | 0.557894 | 0.261718 | -0.930856 | -0.044076 | 1.158458 | 2.277968 | -0.699884 | X |
| 10433 | 0.266074 | 0.556689 | -0.020434 | 0.176624 | 0.261718 | -0.515608 | 0.597681 | 0.178349 | 0.625350 | -0.657245 | G |
| 10434 | -0.054866 | 0.580242 | 0.032912 | -0.016668 | 0.261718 | 1.519109 | 0.371178 | -0.985508 | -0.403638 | 1.276301 | A |
| 10435 | 0.080916 | 0.588093 | 0.015130 | 0.002250 | 0.261718 | -0.930856 | -0.270579 | 0.163807 | -0.091823 | -0.593329 | F |
| 10436 | 0.377169 | 0.014957 | 0.381439 | 0.292753 | 0.261718 | -1.470679 | -0.006326 | -0.494919 | -0.247731 | -1.212974 | H |